From Simple Stochastic Games to Bisimulation Pseudometrics on Markov Decision Processes

نویسندگان

  • Franck van Breugel
  • James Worrell
چکیده

In this paper we investigate the complexity of computing bisimulation pseudometrics on Markov decision processes (MDPs). Our first main result is that such pseudometrics can be computed in the complexity class PPAD. We show that another well-known problem in PPAD—computing the value of a simple stochastic game (SSG)— can be reduced in logarithmic space to the problem of computing the bisimulation pseudometric on a given MDP. In the other direction, we reduce the problem of computing the bisimulation pseudometric to that of computing the value of an SSG. This reduction uses a construction similar to the classical attacker-defender game for bisimulation in the non-probabilistic case, and works in polynomial time for MDPs of a fixed branching degree. Finally, we investigate whether the above bound on the branching degree can be dropped, relating it to the question of whether there is a family of polynomial size SSGs that solve the linear assignment problem.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Algorithms for Game Metrics (Full Version)

Simulation and bisimulation metrics for stochastic systems provide a quantitative generalization of the classical simulation and bisimulation relations. These metrics capture the similarity of states with respect to quantitative specifications written in the quantitative μ-calculus and related probabilistic logics. We present algorithms for computing the metrics on Markov decision processes (MD...

متن کامل

Parameterized Metatheory for Continuous Markovian Logic

This paper shows that a classic metalogical framework, including all Boolean operators, can be used to support the development of a metric behavioural theory for Markov processes. Previously, only intuitionistic frameworks or frameworks without negation and logical implication have been developed to fulfill this task. The focus of this paper is on continuous Markovian logic (CML), a logic that ...

متن کامل

ar X iv : 0 80 9 . 43 26 v 2 [ cs . G T ] 9 O ct 2 00 8 Algorithms for Game Metrics ( Full Version

Simulation and bisimulation metrics for stochastic systems provide a quantitative generalization of the classical simulation and bisimulation relations. These metrics capture the similarity of states with respect to quantitative specifications written in the quantitative μ-calculus and related probabilistic logics. We present algorithms for computing the metrics on Markov decision processes (MD...

متن کامل

Algorithms for Game Metrics

Simulation and bisimulation metrics for stochastic systems provide a quantitative generalization of the classical simulation and bisimulation relations. These metrics capture the similarity of states with respect to quantitative specifications written in the quantitative μ-calculus and related probabilistic logics. We present algorithms for computing the metrics on Markov decision processes (MD...

متن کامل

Simulation-Based Graph Similarity

We present symmetric and asymmetric similarity measures for labeled directed rooted graphs that are inspired by the simulation and bisimulation relations on labeled transition systems. Computation of the similarity measures has close connections to discounted Markov decision processes in the asymmetric case and to perfect-information stochastic games in the symmetric case. For the symmetric cas...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013